Non-linear PLS using Genetic Programming

نویسنده

  • Dominic Searson
چکیده

The economic and safe operation of modern industrial process plants usually requires that accurate models of the processes are available. Unfortunately, detailed mathematical models of industrial process systems are often time consuming and expensive to develop. Consequently, the use of data based models is often the only practical alternative. The need for effective methods to build accurate data based models with a minimum of specialist knowledge has given impetus to the research of automatic model development methods. One method, genetic programming (GP), which is an evolutionary computational technique for automatically learning how to solve problems, has previously been identified as a candidate for automatic nonlinear model development. GP has also been combined with a multivariate statistical regression method called PLS (partial least squares) in order to improve its performance (GP-PLS). One version of this method, called GP_NPLS2, was found to give good performance but at a computational expense deemed too high for use as a modelling tool. In this thesis, the GP-PLS framework is developed further. A novel architecture, called team based GP-PLS, is proposed. This method evolves teams of co-operating sub-models in parallel in an attempt to improve modelling performance without incurring significant additional computational expense. The performance of the team based method is compared with the original formulations of GP-PLS on steady state data sets from three synthetic test systems. Subsequently, a number of other modifications are made to the GP-PLS algorithms. These include the use of a multiple gene sub-model representation and a novel training method used to improve the ability of the evolved models to generalise to unseen data. Finally, an extended team method that encodes certain PLS parameters (the input projection weights) as binary team members is presented. The extended team method allows the optimisation of the sub-models and the projection weights simultaneously without recourse to computationally expensive iterative methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Ghotour-Chai River’s Rainfall-Runoff process by Genetic Programming

Considering the importance of water and computing the amount of rainfall runoff resulted from precipitation in recent decades, using appropriate methods for predicting the amount of runoff from rainfall date has been really essential. Rainfall-runoff models are used to estimate runoff generated from precipitation in the catchment area. Rainfall-runoff process is totally a non-linear phenomenon....

متن کامل

On the optimization of Dombi non-linear programming

Dombi family of t-norms includes a parametric family of continuous strict t-norms, whose members are increasing functions of the parameter. This family of t-norms covers the whole spectrum of t-norms when the parameter is changed from zero to infinity. In this paper, we study a nonlinear optimization problem in which the constraints are defined as fuzzy relational equations (FRE) with the Dombi...

متن کامل

Presentation and Solving Non-Linear Quad-Level Programming Problem Utilizing a Heuristic Approach Based on Taylor Theorem

The multi-level programming problems are attractive for many researchers because of their application in several areas such as economic, traffic, finance, management, transportation, information technology, engineering and so on. It has been proven that even the general bi-level programming problem is an NP-hard problem, so the multi-level problems are practical and complicated problems therefo...

متن کامل

Modeling Ghotour-Chai River’s Rainfall-Runoff process by Genetic Programming

Considering the importance of water and computing the amount of rainfall runoff resulted from precipitation in recent decades, using appropriate methods for predicting the amount of runoff from rainfall date has been really essential. Rainfall-runoff models are used to estimate runoff generated from precipitation in the catchment area. Rainfall-runoff process is totally a non-linear phenomenon....

متن کامل

QSAR models for CXCR2 receptor antagonists based on the genetic algorithm for data preprocessing prior to application of the PLS linear regression method and design of the new compounds using in silico virtual screening.

The CXCR2 receptors play a pivotal role in inflammatory disorders and CXCR2 receptor antagonists can in principle be used in the treatment of inflammatory and related diseases. In this study, quantitative relationships between the structures of 130 antagonists of the CXCR2 receptors and their activities were investigated by the partial least squares (PLS) method. The genetic algorithm (GA) has ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005